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Abstract 


The  theory  of  positive  dependence  notions  cannot  yield  useful 
results  for  some  widely  used  distributions  such  as  the  multinomial , 
Dirichlet  and  the  multivariate  hyper geometric.  Some  conditions  of 
negative  dependence  that  are  satisfied  by  these  distributions  and  which 
have  practical  meaning  are  introduced.  Preservation  results  for  some 
of  these  concepts  are  derived.  Useful  inequalities  for  some  widely 
used  distributions  are  obtained.  Results  of  Mallows  (1969)  that  apply 
to  the  multinomial  distributions  are  extended  to  more  distributions. 
Examples  are  listed. 
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Key  words:  Multinomial,  Dirichlet,  multivariate  hypergeometric,  multivariate 
normal,  PF-  densities,  total  positivity,  negative  and  positive  dependence, 
new  better  than  used,  stochastic  ordering.  Inequalities. 
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1 .  Introduction. 

Concepts  of  positive  dependence  of  sets  of  random  variables  (rv's) 
have  received  a  lot  of  attention  recently.  Their  study  was  found  to 
yield  a  better  understanding  of  the  structure  of  some  widely  used  multi¬ 
variate  distribution  functions  (df's).  In  addition  to  this,  various 
useful  Inequalities  were  obtained  with  applications  in  many  areas  of 
probability  and  statistics.  Barlow  and  Proschan  (1975),  Ch.  5,  Include 
a  review  of  most  of  the  work  done  prior  to  1972.  A  list  of  more  recent 
references  can  be  found  in  Ahmed  et  al.  (1978). 

On  the  other  hand  notions  of  negative  dependence  have  received  very 
little  attention  in  the  literature.  Some  negative  dependence  analogs  of 
positive  dependence  concepts  have  been  mentioned  by  some  authors  (Lehmann 
(1966),  Brindley  and  Thompson  (1972),  Dykstra,  Hewett  and  Thompson  (1973) 
and  Shaked  (1977)  among  others).  In  the  bivariate  setting  the  random 
vector  (T^^)  *s  usually  said  to  satisfy  some  negative  dependence 
condition  if  (T^,-^)  satisfies  the  analogous  positive  dependence 
condition.  However,  this  method  of  formulation  cannot  apply  to  higher 
dimensions.  To  the  best  of  our  knowledge  Lehmann  (1966)  in  the  bivariate 
setting  and  Mallows  (1968)  in  the  multivariate  .setting  came  the  nearest 
to  a  systematic  study  of  negative  dependence  concepts;  Mallows'  discussion, 
however,  is  restricted  to  the  multinomial  distribution.  The  development 
below  is  in  the  spirit  of  Mallows;  his  results  are  special  cases  of  ours. 

While  Che  first  draft  of  this  paper  was  being  written  two  related 
works  were  brought  to  our  attention.  The  first  work  by  Ebrahimi  and  Ohosh 
(1980)  discusses  some  negative  dependence  analogs  of  well  known  positive 
dependence  concepts.  Some  of  our  definitions  overlap  those  of  Ebrahimi  and 
Chosh  (1980);  however,  our  main  results  differ  from  theirs.  The  second 
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related  paper  Is  by  Karlin  and  Rlnott  (1980).  They  introduce  a  negative 
dependence  notion  which  is  closely  related  to  one  of  ours  and  they  obtain 
some  results  which  are  similar  to  ours.  Some  remarks  about  the  relationship 
between  these  two  works  and  the  present  paper  will  be  given  throughout  the 
text. 

The  main  motivation  for  our  definitions  is  to  try  to  formulate  the 
intuitive  requirement  that  if  a  set  of  negatively  dependent  random 
variables  is  split  into  two  subsets  in  some  manner  then  one  Rubset  will 
tend  to  be  'large'  when  the  other  subset  is  'small'  and  vice  versa.  In 
Section  2  we  define  the  conditions  to  be  discussed.  We  derive  some  in¬ 
equalities  in  Section  3  and  prove  some  preservation  properties  in  Section  4. 
Examples  are  given  in  Section  5. 

In  the  following  "increasing"  stands  for  "nondecreasing"  and  "decreasing" 

for  "nonincreasing".  Vectors  in  lRn  are  denoted  by  t  *  (t. ,...,t  )  and 

—  1  n 

t  t/  means  ^  <_  t|,  i  *  l,...,n.  Similarly  t^  <  t/  means  tj  <  tj, 
i  ■  l,...,n,  and  0  ■>  (0.....0).  A  real  function  on  ®n  will  be  called 
increasing  if  it  is  increasing  in  each  variable  when  the  other  variables 
are  held  fixed. 

A  rv  X  is  said  to  be  stochastically  smaller  than  the  rv  Y  (denoted 

by  X  Y)  if  P(X  >  x)  <  P (Y  >  x)  for  every  real  x.  The  random  vector 

X  ■  (X. . X  )  is  said  to  be  stochastically  smaller  than  Y  ■  (Y, , . . . ,Y  ) 

—  in  —  in 

(denoted  by  X  V  Y]  if  g(X)  V  g(Y)  for  every  g  e  C  where  C  is  the 
class  of  Borel  measurable  increasing  functions  on  3Rn.  It'  X  and  Y  have 
the  same  df  then  we  write  »CY.  It  is  well  known  that  X  \  Y  if  and 
only  if 

P(XeU)  £  P(YeU)  for  every  upper  Borel  set  U  in  Ttn. 


(1.1) 


[U  is  an  upper  set  if  x  e  U  and  x  5.  i  implies  that  x  e  U.]  According 
to  Kamae,  Krengel  and  O'Brien  (1977),  we  need  only  consider  open  upper 
sets  U  in  JRn.  It  is  also  well  known  that  for  every  random  vector  X^ 

(1.2)  X  +  a  ^  X  whenever  a,  £ 
and  that 

(1.3)  X  +  A  V  X  whenever  A  is  a  nonnegative  random  vector. 

Also  if  P(X  ^  0)  -  1  then 

(1.4)  aX  V  X  whenever  a  >_  1. 

See  Ar)as  and  Lehtonen  (1978)  for  an  excellent  review  on  stochastic 
ordering. 

If  X  and  Y  are  random  vectors  such  that  X  Riven  that  Y  *  y_ 
is  stochastically  smaller  (larger)  than  X  given  that  Y  -  jp' ,  whenever 
X  .1  x'  and  and  x'  are  in  the  support  of  Y,  then  we  write 

{X| Y  -  y]  (?$  X* 

More  precisely,  this  means  that  for  every  upper  set  U,  there  exists  a 
version  of  P(X^  e  u|Y)  -  $  (Y )  such  that  $(y)  is  increasing  (decreasing) 
in  y  on  the  support  of  Y. 

A  bivariate  function  K(*,*)  which  is  defined  on  x  (where 
and  S2  are  subsets  of  H)  is  said  to  be  totally  positive  of  order  2 
(TP 2)  on  x  s2  if  K(x,y)  >_  0  and  if 

K(x,y)  K(x',y')  >_K(x,y')  K(x’,y)  whenever  x  <_  x1 ,  y  <_  y' 


(1.5) 
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(see  Karlin  (1968)).  The  function  K  Is  said  to  be  reverse  regular 
of  order  2  (RR2)  on  x  s,,  if  K(x,y)  >_  0  and  If 

(1.6)  K(x,y)  K(x',y')  <_K(x,y')  K(x',y)  whenever  x£x',  y  <_y', 

(see  Karlin  (1968),  p.  12). 


2 .  Negative  dependence  concepts. 

Most,  but  not  all  of  the  positive  dependence  concepts  discussed  in  the 
literature  have  negative  dependence  analogs  that  can  be  obtained  by  changing 
the  direction  of  the  monotonicity  or  of  the  inequalities  which  define  them. 
Here  we  define  four  conditions  of  this  type  which  we  find  useful  because  we 
have  methods  of  identifying  distributions  which  satisfy  them.  Two  of  them 
are  direct  analogs  of  "conditionally  increasing  in  sequence"  and  "positive 
orthant  dependent".  The  third  has  a  positive  dependent  counterpart,  but  it 
does  not  seem  to  have  been  discussed  in  the  literature,  while  the  fourth  is 
a  variation  of  "totally  positive  of  order  two  (TPj)  in  pairs". 

In  the  case  of  positive  dependence,  one  of  the  strongest  and  most  use¬ 
ful  notions  is  that  of  TPj-ness  in  pairs;  that  is,  the  joint  density  or  the 
discrete  probability  function  f  is  assumed  to  exist  and  be  TP^  in  pairs 
(see  Barlow  and  Proschan  (1975),  p.  149).  The  natural  negative  dependence 
analog  then  is  to  assume  that  f  is  RR2  in  pairs.  There  are  several 
drawbacks  to  this  notion,  however.  Firstly,  unlike  the  situation  in  which 
the  joint  density  (or  discrete  probability  function)  is  TP2  in  pairs,  the 
marginal  densities  do  not  necessarily  enjoy  the  same  property.  A  simple 
3x2*2  discrete  example  suffices  to  show  this.  In  fact.  Theorem  5.1, 
p.  123  of  Karlin  (1968)  ia  false  when  TP2  is  replaced  by  RRj.  (It 


5 


should  also  be  remarked  here  that  even  In  the  TP^  case,  one  must  make 
some  assumptions  on  the  nature  of  the  set  {f  >  0}  before  one  can  use 
the  above  Theorem  5.1  to  conclude  that  the  marginal  densities  are  also 
TP ^  in  pairs:  see  Kemperman  (1977)).  A  possible  alternative  then  is  to 
assume  that  not  only  f,  but  all  its  marginal  densities  are  RR^  in  pairs. 
But  even  under  this  assumption  we  have  not  been  able  to  show  that  our 
weakest  condition  (see  Definition  2.3)  is  consequently  satisfied.  Ebrahlmi 
and  dhosh  (1980)  claim  to  have  proven  this  result;  however,  their  proof  is 
based  on  an  implication  which,  as  will  be  shown  below,  does  not  hold  (see 
discussion  after  Definition  2.4). 

Because  of  these  drawbacks  and  since  we  do  not  always  want  to  assume 

the  existence"  of  a  density,  we  prefer  to  work  directly  with  the  measure 

itself.  This  point  of  view  is  consistent  with  our  other  definitions  in 

that  we  do  not  make  any  assumptions  on  the  existence  of  a  density  in 

defining  them.  We  are  thus  led  to  our  first  definition. 

I.et  p  be  a  probability  measure  on  the  Borel  sets  in  IRn.  If 

I ,,...,  I  are  intervals  in  Ir\  we  define  the  set  function  p(I,  ,...,1  ) 

in  in 

by  u(I, ,...,I  )  ■  p(I  x. .  .xi  ).  By  abuse  of  notation,  we  write  p  instead 
in  In 

of  p.  If  I  and  J  are  intervals  in  we  write  I  <  J  if  x  e  I, 

y  c  J  implies  x  <  y,  that  is,  I  lies  to  the  left  of  J. 

2 

Definition  2.1.  Let  p  be  a  probability  measure  on  F  .  We  say  that  p 
is  reverse  regular  of  order  two  (RI^)  if 

(2.1)  P(I1,I2)  l«(IJ,Ip  <  p(I1,Ip  y(Ii,I2) 

for  all  intervals  1^  <  Ij,  1 2  <  I^  in  F* .  We  also  say  that  yd^.Ij) 
is  RR2  in  the  variables  Ij,I2>  If  p  is  a  probability  measure  on  Fn 


(n  >_  2),  we  say  that  y  is  RR^,  in  pairs  if  y(Ij,...,In)  is  RR2  in 
the  pairs  1^,1^  f°r  a*l  1  <  i  <  j  n  when  the  remaining  variables  are 
held  fixed.  The  random  variables  T, , . . . ,T  (or  the  random  vector  T  or 

its  distribution  function  F)  are  said  to  be  RR^  in  pairs  if  its 
corresponding  probability  measure  on  TRn  is. 


Remarks: 

(i)  The  obvious  TP^  definitions  for  y  are  obtained  by  reversing 
the  inequality  in  (2.1). 

(ii)  Clearly,  if  y  is  RR2(TP2)  in  pairs,  then  so  are  all  marginals. 
Furthermore,  it  is  not  difficult  to  show  that  if  F  is  the 
distribution  function  associated  with  y  and  if 
F(t1,...,tn)  =  (t2, »),... ,ttn,®))  is  the  survival 

function,  then  the  functions  F  and  F  are  RR2(TP2)  in  pairs 
in  the  sense  of  (1.5)  and  (1.6). 

(iii)  It  is  easy  to  show  by  a  simple  limiting  argument  that  if  y  is 
RR2(TP2)  in  pairs,  and  if  y  has  a  density  f  with  respect  to 
a  product  measure  m  ■  ot  o-finite  measures  such  that 

f  is  continuous  on  the  support  of  m  and  zero  off  the  support 
of  m,  then  f  is  RR2(TP2)  in  pairs. 

(iv)  In  the  n  *  2  case,  we  have  the  stronger  converse,  namely,  if 

y  has  a  density  f  with  respect  to  a  product  measure  m  ■  m^  x  m 
of  o-finite  measures  which  is  RR2(TP2)  on  *  S2,  where 
is  the  support  of  m^(i  ■  1,2),  then  y  is  RR2(TP2), 

(v)  In  the  TP2  case,  one  can  generalize  to  higher  dimensions  if  one 
makes  some  assumption  on  the  set  {f  >  0}.  Let  y  have  a 
density  f  with  respect  to  a  product  measure  m  ■  mjX..,xmn  of 
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o-finite  measures.  Let  be  the  support  of  m^ .  Then 

the  support  of  m  is  S  =  S*  xg  .  We  assume  that  there 

In 

exists  S  =  S  such  that  (f  >  0}f\S  =  S  and  that  f 

In  ' 

is  TP^  in  pairs  on  S.  Then  p  is  TP^  in  pairs.  Just 

use  Theorem  5.1,  page  123  of  Karlin  (1968)  repeatedly  to  show, 

e.g.,  that  for  fixed  intervals  in  1r\ 

J  n 

g(xlfx2)  -  /.../  f(x1,x2,x3,...,xn)  dm3(x3) . . .dmn(xn> 

I,  I 
3  n 

is  TP2  in  x3  and  x2  on  x  §2.  The  result  then  follows 
by  a  simple  integration  using  the  TP2  ‘  inequality  for  g. 

(vi)  The  generalization  to  the  RR2  case  is  not  as  simple.  If  one 
assumes,  however,  that  p  has  a  density  f  with  respect  to  a 
product  measure  m  «  m^x.^xm^  of  o-finite  measures  such  that 
the  density  f  when  integrated  over  any  n-2  intervals  in  JR1 
is  RR2  in  the  remaining  unintegrated  variables,  then  p  is 
RR2  in  pairs.  In  terms  of  random  variables,  this  can  be  para¬ 
phrased  as  follows.  Let  T^ . Tn  be  random  variables  with  a 

density  f  (with  respect  to  a  product  measure  of  o-finite 
measures).  Then  p  is  RR2  in  pairs  if  and  only  if  for  every 
1  <_  i  <  j  <_  n  the  conditional  density  of 


(T.,T  )|  r\  (T.  £  I.) 

1  j  k*i,j 

is  RR2  in  t^  and  tj  for  all  choices  of  intervals  I^(k^i,j) 
in  1R*.  Equivalently,  if  X^  denotes  the  indicator  function  of 
I ,  then  p  is  RR2  in  pairs  if  and  only  if 


J.../  [  n 

1R  JR  k*i,j 


Xt  (tk»]  f(tl 
k 
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is  RR^  in  the  unintegrated  variables  t^  and  t^  for  all 

choices  of  intervals  I,  (k^i,j)  in  1R^.  By  replacing  XT 

Tk 

by  <(>^  in  the  above  integral,  and  requiring  it  to  be  RR2  in 
t^  and  tj  whenever  is  a  set  of  PF^  functions, 

one  obtains  the  negative  dependence  condition  of  Karlin  and 
Rinott  (1980).  It  is  a  condition  which  is  stronger  than  the 
one  of  Definition  2.1  as  can  be  easily  seen  by  recalling  that 
the  indicator  function  of  an  interval  is  PFj. 

(vii)  Clearly,  if  {y^}  is  a  sequence  of  RR£  in  pairs  probability 
measures  and  if  pn  converges  weakly  to  p  then  p  is  RR^ 
in  pairs. 


Definition  2.2.  The  rv's  T^,...,Tn  (or  the  random  vector  T  or  its 
df)  are  said  to  be  conditionally  decreasing  in  sequence  (CDS)  if,  for 
i  =  1,2, .. . ,n-l , 

(2.2)  f  T  i_|_i  I T 1  **  t^ , . . .  ,Tj  *•  t^J  ♦  (tj,...,t^). 

Definition  2.3.  The  rv's  T . T  (or  the  random  vector  T  or  its 

-  I  n  — 

df)  are  said  to  be  negatively  upper  orthant  dependent  (NUOD)  if  for  every  £, 

(2.3)  P (T  >  t)  <  "  P (T .  >  t<). 

—  —  —  i-l  i  1 

They  are  said  to  be  negatively  lower  orthant  dependent  (NLOD)  if  for  every  t^, 

P(T  <  t)  <  r  P (T .  <  t . ) . 

- -  i-l  i  -  i 


(2.3') 
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When  n  *  2,  (2.3)  and  (2. 3')  are  equivalent,  but  not  when  n  >_  3 
(see,  for  example,  Ebrahimi  and  Ghosh  (1980)). 

The  next  concept  has  a  natural  positive  dependence  analog,  however, 
we  are  not  aware  of  any  place  in  the  literature  in  which  it  has  been 
discussed . 


Definition  2.4.  The  rv's  T^,...,Tn  (or  the  random  vector  T.  or  its 
df)  are  said  to  be  negatively  dependent  in  sequence  (NDS)  if,  for 
i  =  2,3, ... ,n, 

(2. A)  [(Tj, . . . ,T±_1) |T±  -  t±]  Bf  t±. 

Often,  to  verify  (2.4),  one  can  find  it  easier  to  verify  that  for 
i  ~  1 , . . . ,n , 


(2.4') 


l  (T^  ,  .  >  •  »  •  •  •  I 


ti1 


i- 


Then  clearly  (2.4)  holds. 

We  now  investigate  some  of  the  relationships  among  the  various 
definitions.  First  note  that  NDS  implies  both  NUOD  and  NLOD  and  these 
implications  are  sharp.  To  see  this,  use  methods  similar  to  Barlow  and 
Proschan  (1975)  to  show  that  (2.4)  implies  for  1  ■  l,...,n-l. 


(2.5) 


[(t1,...,t1)|t1+1  >  ti+1 


]  ?  [(Tx . T^)  |T 


i+1  >  ti+l1 


whenever  (Although  Barlow  and  Proschan  assumed  the  existence 

of  a  density,  a  modification  of  their  proof  works.)  But  from  (2.5)  it 


follows  that 
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P  (T.  >  t 


1*' 


•*T 


Cn> 


i  P(Tj 


.,T 


n-1 


t  P(Tn 

n- 1  n 


i  P (Tj  >  tj 


••V2  »  ‘n-2> 


!  P(T  »  t,) 
i-n-1 


i  ...  i  "  P(T.  >  t.) 
1-1  1  1 


which  proves  (2.3).  The  proof  of  (2.4) (2.3')  Is  similar. 

It  is  not  difficult  to  construct  an  example  which  shows  that 
CDS  NUOD  and  that  CDS  ^  NLOD  and  hence  CDS^NDS.  For  example,  let 
P(T1-1,T2-1)  -  P(T1-l,T2-2)  -  P(T1-2,T2-2)  -  .1  and  P(T  -2,T2-1)  -  .7 
and  let  given  -  1,  T2  -  1  be  degenerate  at  11,  given 

Tj  -  1,  T2  -=  2  be  degenerate  at  1,  T3  given  Tj  -  2,  T2  “  1  be 
degenerate  at  10  and  T3  given  -  2 ,  T2  -  2  be  degenerate  at  1. 

Then,  clearly,  (T^T^T^  is  CDS  but  P(T1  >  1,  Tj  >  1)  -  .7  >  .64 
=  P(T]  >  1)  P(T3  >  1),  thus  (T1,T2,T3)  is  neither  NUOD  nor  NLOD . 

Ebrahimi  and  Ghosh  (1980)  claim  that  CDS=>NU0D;  the  example  shows  that 
this  is  not  the  case. 

Next  we  show  that  NDSjt^CDS.  Let  (T^,T2>T3)  take  on  values  on 
the  eight  vertices  of  the  unit  cube  such  that 

P (Tj=l ,  T2=0,  T^-l)  -  P(T^-1,  T2-l,  Tj-0)  -  .2  and  the  other  six  pro¬ 
babilities  are  .1.  It  is  easy  to  see  that  and  T2  are  independent 

and  that  [(T^)^  -  0]  V  [(T^T^Tj  -  lj.  Thus,  (T^.Tj)  is 
NDS .  But  P(T3  >  0 | Tx  -  0,  T2  -  0)  -  1/2  <  2/3  -  P(T3  >  0|T1  -  1 ,  T?  -  0), 
hence  (T^.T^T^  is  not  CDS. 

We  will  shortly  show  that  under  some  reasonable  assumption,  RR2:=£CDS. 
It  is  not  known  whether  RR2^  NDS,  but  it  does  imply  NUOD  and  NLOD.  This 
is  clear  since  from  the  remark  (ii)  following  Definition  2.1,  we  have  that 
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PPM"  , J  . 


both  K  and  F  are  RR2  in  pairs.  The  result  then  follows  by  a 
simple  argument. 

The  implications  RR2=£CDS,  RR2=?NUOD,  RR^  NLOD,  NDSr^NUOD 
and  NDS^NLOD  justify  the  consideration  of  the  RR2  and  the  NDS 
concepts.  In  Sections  4  and  5  it  will  be  shown  that  many  df's  are  RR2 
or  NDS  and  thus  these  df's  satisfy  the  meaningful  CDS  concept  and  the 
inequalities  which  can  be  derived  from  the  NUOD  and  NLOD  concepts.  In 
addition  to  it  the  NDS  concept  is  intuitively  meaningful  by  itself.  We 
mention,  in  passing,  that  since  Karlin  and  Rinott's  (1980)  condition 
implies  the  RR2  condition  it  follows  that  it  implies  the  NUOD  and  the 
NLOD  inequalities  (2.3)  and  (2.3').  Actually,  Karlin  and  Rinott  (1980) 
obtained  some  additional  useful  inequalities  which  follow  from  their 
stronger  condition. 

To  justify  calling  (2.1)  —  (2.5)  "conditions  for  negative  dependence" 
we  have  to  show  that  they  imply 

(2.6)  COVOI^.Tj)  <0,  1U<  j<n, 

when  the  second  moments  exist. 

From  (2.3),  it  follows  that  P(T^  >  t^,  T^  >  t  ^  )  ^P(T^  >  t^)  P(Tj  >  tj) 
and  it  is  well  known  that  this  inequality  implies  (2.6)  [see  e.g.,  Lehmann 
(1966)].  Similarly  (2.3')  implies  (2.6)  and  hence  also  (2.4)  and  (2.5) 
imply  (2,6). 

If  (T^,...,Tn)  is  CDS  then  [T2 1 T^  ■  t^]  a&  t^{  hence  cov(Tj,T2)  0. 
Thus,  if  (Tff  (i ) , . . .  ,T^  (n) )  is  CDS  for  every  permutation  tt  of  {l,2,...,n} 
then  (2.6)  holds. 

Vie  close  this  section  with  a  proof  that  RR2  in  pairs  implies  CDS  under 
some  reasonable  assumptions.  Let  p  be  RR2  in  pairs  and  let  (T^,...,T^) 


It  nnv  random  vector  having  11  as  its  induced  probability  measure. 


I'ix  t  l  IK  and  an  integer  i,  1  <_  i  n-1 .  Define  the  set  functions 
v  and  A  on  the  Borel  subsets  of  1R*  by 

v(A)  -  P(T  x>  t1+1;  (Tx . Tt)  e  A) 


A  (A)  -  P((T  . Tj)  e  A). 

£■  °° 

Note  that  A  is  just  a  marginal  of  y.  Then  if  is  a  sequence 

of  rectangles  which  partition  TR*  for  each  i  and  whose  mesh  size  tends 

to  zero  as  l  -*■  and  if  (R,,  }  is  a  refinement  of  (R,,  }  for  all  i, 

ik  ik 

then  it  is  well  known  by  martingale  arguments  (see,  e.g.,  Meyer  (1966)) 

that  on  a  set  I)  with  A(DC)  *  0,  v^®ik^^Rik^  *  ^  ^  l  *  *  *  *  * c  38  Rik 

decreases  to  (t^ . t^)  pointwise  and  in  L^;  moreover  <J> (t ^  .  ,t^) 

is  a  version  of  P<T1+1  >  ti+ilTi  *  *  t^).  We  may  assume  without 

loss  of  generality  that  D  C  support  A.  We  want  to  show  that  <p  is 
decreasing  on  D.  This  will  follow  if  we  can  assume  that  the  support 
A  of  A  satisfies  a  chain  condition;  that  is,  if  t  e  A  and  t_'  e  A  with 


t  <  t’,  then  there  exist  t,  e  A  such  that  t“t_<t.  <...<t  “  t '  and 
—  —  —  — j  —  —i)  — X  —  —  *“ m  — 

t,  differs  from  t...  in  only  one  component.  To  see  this,  suppose  that 
“1  “1+1 

t^  and  _t  differ  in  only  the  first  component.  Then  let 

h  ‘  ’!•  h+i  -  <-•'!«!  *  and  h . h  be 

intervals  such  that  t^  e  1^  *  Ij*****!^  — 1  E  ^l  x  l2X,'*x*i‘  p 

is  the  marginal  of  (^ . T  j) ,  then  since  it  is  RR2  in  pairs,  we  have 
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or  that 

v(I  ^  »^2  ’  *  *  *  ^^1  *^"2  * ' ' '  '^1  ^ 

a(i1,i2,...  ,i1)  -  a(i',i2,...,i1)  * 

Note  that  there  is  no  trouble  dividing  since  the  denominators  are  non  zero. 

Iterating  this  procedure  to  pass  from  t.  to  t_,  t-  to  t_,...,t  .  to 

t  and  then  letting  the  intervals  shrink  we  find  that  f(t)  >  $(t').  Now 

—  —  — 

set 

Cf(t)  If  t,  E  D 

inf  { 4>  C^_) ;  £  e  D,  s_  <_  _t }  (inf  0  -  +»)  if  _t  ^  D. 


It  easily  follows  that  4>*  is  decreasing  everywhere  and  is  a  version  of 

p(Ti+l  >  ci+1|Ti  “  t1»****Ti  =  ti)’ 


The  chain  condition  is  easily  seen  to  be  satisfied  if  the  support 
of  u  is  a  cross  product.  Without  some  type  of  chain  condition,  we  can 
only  show  that  <j>  is  decreasing  componentwise  on  D.  In  this  case,  one 
may  not  be  able  to  extend  $  such  that  it  is  decreasing  everywhere  and  is 
still  a  version  of  the  condition  probability. 


3.  Some  inequalities. 

This  section  is  devoted  to  the  derivation  of  some  inequalities  which 
may  be  of  special  interest.  The  results  closely  parallel  those  found  in 
Karlin  and  Rinott  (1980),  but  are  derived  under  weaker  assumptions. 

Proposition  3.1.  (^ . T^)  is  NUOD  if  and  only  if 

E {  "  MT<>]  1  "  .(T  )] 

i-1  11  i-1 


(3.1) 
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wlienever  all  $  are  nonnegative  and  Increasing.  The  result  (3.1) 
is  also  true  if  we  replace  NUOD  by  NLOD  and  increasing  by  decreasing. 

Proof.  Let  <^J  (t)  -  X/t  v(t),  i  -  l,...,n.  Then  (3.1)  reduces  to 

-  i  (bjt”) 

the  NUOD  inequality  (2.3).  Since  each  side  of  (3.1)  is  multilinear  in 
the  the  result  holds  for  nonnegative  linear  combinations  of  such 

indicator  functions  and  hence  for  the  general  4> ^  by  a  standard  limiting 
argument.  |  | 

Now  suppose  that  p  is  RR2  in  pairs.  For  every  i,  let  1^  ■  K^, 

all  intervals,  with  <  K^. 

Theorem  3.1.  If  1  k  <_  n ,  then 

(3-2>  "<Ji . V  . V  . 4'Vi . V 

x  U<I1 . Ik,Jk+l . V- 

The  result  (3.2)  is  also  true  if  ve  replace  all  J's  by  K's. 

Proof .  We  proceed  by  induction.  If  n  ■  2,  then 

ruU^I,)  »(I,,J,)  if  k-1. 
u(JltJ2)  v(lvl2)  1  )  1 

|_p(JltJ2)  P(I1,I2)  if  k  -  2. 

The  case  k  ■  1  follows  from  the  RR£  assumption  since 

u(J1,J2)  (j1,j2v/k2) 

I 

(j^Kj^.Jj)  (jjyl^.J^Kj) 


0  > 


|(J1,J2)  uU^Kj) 


uOc^)  pO^.Kj) 


and  the  case  k  =  2  is  an  identity. 


Now  suppose  that  (3.2)  is  true  whenever  v  is  a  probability  measure 

on  VRn  which  is  RR^  in  pairs.  Let  p  be  a  probability  measure  bn  lRn+1 

which  is  RR^  i*i  pairs  and  let  1  <_  k  <_  n+1.  Since  there  is  nothing  to 

prove  if  k  =  n  +  1,  we  may  assume  that  1  <_  k  <_  n.  Similary,  we  may 

assume  that  ii(J. . J  ,J  . , )  i  0.  It  suffices  then  to  prove  that 

I  n  n+i 


(3>3)  -(Ii . '.•Vi1 _  »ai . Wi1 _ 

"(J1 . Jk,Ilctl . ’ic’ntd  “  P<J1 . Jk’Ik+l . In’Jn+l) 

since  by  the  induction  hypothesis,  we  have  that 

ti(Il . In,Jn+l) _  <  W(I1 . Tk,Jk+l . Jn,Jn-H) 

p(J1,...,Jk,ik+1,...,rn,Jn+1)  -  p(Jlt...,Jn,Jn+1) 

But, 

u(Il,”,,In,In+l)  p(J1,I2,...,In,In+1) 

”(Ii . V-W  -  w(J1,I2 . VW  -**•' 

P*J1 . Jk,Ik-fl,,,,,In,In+l) 

P  (JL, . . . ,Jk,Ik+1, . . . ,In,Jn+1) 

which  is  another  way  of  writing  (3.3).  The  jth  inequality  above  follows 
from  the  fact  that  p  is  RR2  in  the  pair  j  and  (n+1).  || 

Suppose  that  (T  ,...,Tn)  is  RR2  in  pairs. 

Corollary  1.  If  a,  6  partition  {l,...,n),  then 
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(3.4)  Pfl^eJ  .ieo^B)  P (T^el ^ , ica^  8) 

<  P(TieJi,iea;TjeIj,jc6)  P(T±cIi  ,ie«  ;T  cJ^ ,  je8) . 

It  also  holds  true  If  we  replace  all  J's  with  the  K's. 


Corollary  2.  If  a,  8,  y  partition  {l,...,n},  then 

(3.5)  P(T1eLi,iea;TjeJj,jeBuY)  P(T1ELi,iea;TjeI.j  ,jc8^y) 

1  P(TieLi,iea;TjEJj ,jeB;TkeIk,kEY)  P (T^eL^ ,iea jT^elj ,  jeB;TkeJk,kEY) 

for  any  intervals  L^,  i  e  a.  It  also  holds  true  if  we  replace  all  J's 
with  the  K's. 

If  we  take  °  (-“fb.^]  and  1^  *  for  i  £  au  8  in 

Corollary  1,  we  get  (3.5)  of  Karlin  and  Rinott  (1980).  If  we  take,  in 
Corollary  2,  =  [a^.b^],  i  e  a,  ■  (-“.b^]  and  1^  -  (-®,®)  for 

1  e  Bvj  y  we  get  (1.7)  of  Karlin  and  Rinott  (1980). 

Also,  note  that  as  soon  as  we  have  an  inequality  of  the  form 


<3-6)  P(T1  i  bl . T„  i  b„>  i  P(T1  i  bl . V  P(Vl  iVl . T„  i  bn> 

it  follows  as  in  the  proof  of  Proposition  3.1  that 


(3.7) 


E[  tt  $  (T.  )]  <  E[  ir  <f. .  (T.  )  ]  E[  tt  d>  (T.)  ] 
i-1  1  1  ”  i-1  1  1  j-k+i  i  J 


whenever  are  nonnegative  and  decreasing.  Similarly,  if  we  have 


P(T1  »  al . T„  »  *„>  2.P<T1  *  *1 . Tk  >  *k>  p<Tk«  *  “k+1 . T„  *  *n 


(3.8) 


then  (3.7)  holds  for  4>  nonnegative  and  increasing. 

In  particular,  (3.6)  holds  If  F  Is  RR,,  In  pairs  and  (3.8)  holds 
if  F  is  RR£  in  pairs.  In  fact,  if  F(or  F)  is  RR,,  in  pairs,  then 
it  is  MRR^  in  the  sense  of  Karlin  and  Rinott  (1980);  i.e., 

F(xAl)  F(WjO  <F(x)  F(Z). 


4 .  Closure  Results. 

Preservation  theorems  are  useful  for  identifying  negatively  dependent 
df’s  or  for  constructing  new  negatively  dependent  df's  from  known  ones. 

In  this  section  we  discuss  some  preservation  results  and  describe  a  method 
for  the  construction  of  negatively  dependent  df's. 

Theorem  4.1.  If  T, , . . .  ,T  are  (*)  and  if  iji,  , . . .  ,\p  are  strictly 

in  in 

increasing  functions  then  (T, )  , . . .  ,i|>  (T  )  are  (*)  where  (*)  is 

11  n  n 

one  of  the  following:  RR2  in  pairs,  CDS,  NDS,  NUOD  or  NLOD. 

Theorem  4.2.  If  (T^,...,Tn)  and  (S^ . S^)  are  independent  and  are  (*) 

then  (T^ . T^ ,  S  ,...,S  )  is  (*)  where  (*)  is  the  same  as  in  Theorem  4.1 

The  proofs  of  these  theorems  are  straightforward  and  will  be  omitted. 
The  following  preliminaries  are  needed  for  the  statement  of  Theorem  4.3 

A  univariate  density  f  is  said  to  be  a  Polya  frequency  function  of 
order  2  (PF2)  if  f(x-y)  is  TP2  on  H  *  ®.  A  probability  function  f 

is  PF2  if  f(x-y)  is  TP2  on  f!  x  IN  where  ti  ■  {...,-1,0,1,...}.  A 


thorough  discussion  of  PFj  densities  and  many  examples  can  be  found  in 
Karlin  (1968). 
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It  will  bo  shown  In  a  later  section  that  many  multivariate  random 
v.iri.illes  satisfy  a  certain  structural  condition.  In  the  next  theorem 
we  will  show  that  this  condition  implies  some  of  the  dependence  conditions 
that  we  have  introduced. 


Theorem  4.3.  Let  Sg,S^,...,Sn  be  independent  rv's  and  assume  that 

each  has  a  PF^  density  (or  probability  function).  Fix  s  and  let 

(T,  , . .  .  ,T  )  have  the  same  joint  df  as  the  conditional  df  of  (S,,...,S  ) 
L  n  in 

given  that  S.  +  S,+...+S  “  s,  that  is, 

()  1  n 


(4.1) 


(T1,...,Tn)  S-  [(S1,...,Sn)|SQ  +  S1+...+Sn  -  ■]. 


Then  (Tj , . . . ,T^)  is  RR2  in  pairs  and  consequently  CDS,  NUOD  and 


NLOD . 


Proof.  Let  y  be  the  probability  measure  of  (T, . T  )  on  lRn.  Then 

-  i  n 

by  assumption  p(I,,...,I  )  «  P(S,  e  I,,...,S  e  I  I S_  +  S-+...+S  ■  s) . 

L  n  ii  nnui  n 

Now  the  loint  density  of  [ (S, ,...,S  ) I Srt  +  S-+...+S  ■  s]  is  given  by 

i  n  u  i  n 

n 

c  IT  f  (s  )f  (s  -  S--...-S  ) 
i=l  i  i  0  1  n 

where  c  is  a  normalizing  constant.  We  first  show  that  p  is  RR2  in 
the  variables  Ij,  I2  when  the  remaining  intervals  I^,...,In  are  held 
fixed.  According  to  the  remark  (vi)  following  Definition  2.1,  we  need 
only  show  that 

g(s1,s2)  -  c  f2^s2^  f0^S_Sl",82^ 


is  RR2  in  s^  and  s2»  where 
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•  '('.)  =  /•••/  f.(s  )...f  (s  )  f  (f-s  )  dm(s  )...dm(s  ) 

<>  3  3  n  n  0  3  n  3  n 


r,  T 

)  n 


and  m  is  either  the  Lebesgue  measure  or  the  counting  measure.  However, 

t lie  above  is  nothing  but  the  convolution  of  the  PF^  functions 

f,  X.  . f  X.  and  f~,  where  X  is  the  indicator  function  of  the 

j  I  t  n  I  U  A 

3  n 

set  A,  and  so  f^  is  eas*-ly  follows  then  that  g  is  RR2* 

The  proof  that  u  is  RR2  in  the  variables  1^,  I  for  all  1  <_  i  <  j  <_  n 
is  similar .  I | 


Remark.  Actually,  it  is  not  difficult  to  show  that  under  the  assumptions 

of  Theorem  4.3  the  df  of  the  random  vector  (T,  , . . .  ,T  )  satisfies  the 

1  n 

stronger  condition  of  Karlin  and  Rinott  (1980).  The  consequences  of  this 
ole  ervation  are  discussed  below. 

It  is  well  known  (see  Section  5)  that  the  multinomial,  the  multivariate 
hypergeometric,  the  Oirichlet  and  the  Dirichlet  compound  multinomial  random 
vectors  as  well  as  some  multivariate  normal  random  vectors  with  nonpositive 
correlations,  can  be  represented  as  in  (4.1).  Thus,  all  the  respective  df's 
are  RR2  in  pairs.  Karlin  and  Rinott  (1980)  have  shown  that  these  df's 
actually  satisfy  their  stronger  condition;  however,  their  proofs  are  quite 
involved  and  differ  from  one  case  to  another.  By  the  Remark  after  Theorem  4.3 
these  results  of  Karlin  and  Rinott  (1980)  follow  at  once. 

The  following  theorem  will  be  found  useful  in  the  next  section. 


Theorem  4.4.  Assume  that  (T^,...,Tn)  and  (S^,...,Sn)  are  independent 
and  NOS.  If  all  the  univariate  marginal  densities  (with  respect  to  Lebesgue 
measure)  or  probability  functions  in  the  discrete  case  of  and  T  are 
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PF„ ,  then  (T,  +  S, , . . . ,T  +  S  )  is  NDS. 
2  1  I  n  n 


Remark .  Karlin  and  Rinott  (1980)  have  proven  a  similar  result.  They 


assumed  that  IS  and  T  satisfy  their  RR2  condition  and  that  they 
have  PF2  marginals,  and  they  showed  that  then  £  +  T  satisfy  some 


inequalities  that  are  essentially  variants  of  the  NUOD  and  the  NLOD 


inequalities. 


For  the  proof  of  Theorem  4.4  we  need  the  following  lemmas. 


Lemma  4.1.  Let  X  and  Y  be  independent  rv's  and  assume  that  Y  has 


a  PF^  density  (probability  function).  Then 


[X|X  +  Y  -  z]  V  z. 


This  lemma  is  actually  Example  12  of  Lehmann  (1966). 


Lemma  4.2.  Let  X  and  Y  be  independent  and  assume  that  X  and  Y 


have  PF^  densities  (probability  functions).  Then 


(4.2) 


[ (X ,Y) | X  +  Y  -  z]  z. 


Proof.  Let  z-^  <  zn.  Denote  by  V.y)  the  upper  set  {(s,t):  s  >  x,  t  >  y). 


First  it  will  be  shown  that  for  every  (x,y) 


,3)  P{  (X  ,Y)  e  U(x  y)|x  +  Y  ■  z^}  <P{(X,Y)  e  U(x,y)lX  +  Y  "  z2h 


Let  -  { (s,t) :  s  +  t  -  zi,  x  <  s  <  zt  -  y}, 

B  -  {(s,t):  s  +  t  -  zit  s  >  z±  -  y),  C±  -  {(spt):  s  +  t  -  z±t  s  <  x). 


I 


I 


i  =  1,2 


I  s  c 

(see  Figure  1).  From  Lemma  4.1  [X|X  +  Y  =  z]  t  z;  hence 


P{  (X,Y)  e  Cjx  +  Y  =  z2>  =  P{X  <_  x|x  +  Y  =  z  } 


>.  P{X  <  x|x  +  Y  -  z2J  -  P{ (X,Y)  e  C2|X  +  Y  »  z2). 

Similarly 

P{  (X  ,Y)  e  B^X  +  Y  -  z^  >  P{(X,Y)  e  B^X  +  Y  -  z2> . 

Thus 

P{ (X,Y)  £  Aj|X  +  Y  -  2l>  >  P((X,Y)  E  A2 jX  +  Y  -  z2) 
which  is  (4.3). 

t 


Figure  1. 

m 

Now  consider  upper  sets  of  the  form  U  *  W  U,  \  *  called  funda- 

i«l 

mental  upper  domains  in  Block  and  Savits  (1979).  Without  loss  of  generality 
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assume  that  x^  <  <_  . . .  and  y^  >_  > 


>  y  .  Define 
—  m 


=  min  { i :  x^  +  y  £_  z^} 
j ^  *  min  {i  >  1^:  x1  +  yi  >  z^s 

and,  by  induction 

(A. 4)  iR+1  -  min  {i  >  jfc:  xt  +  yt  <_  z^) 


(4.5) 


k+1 


min  { i  >  i 


k+1' 


Xi  +  yi  >  Z2} 


k  =  1,2,... ,n  where  n  <  “  is  the  largest  k  such  that  the  set  on  the 
right  hand  side  (RHS)  of  (4.4)  or  (4.5)  is  not  empty.  If  there  are  n  i's 
but  only  n-1  j's  define  “  m  +  1  (see  Figure  2  in  which  m  ■  12, 
n  »  3,  i1  «  3,  jx  -  6,  i2  -  8,  J2  -  11,  i3  -  12,  j3  -  13). 

n 


from 


Let  U  «  U  ..In  the  following  the  first  inequality  follows 

k-i  (x4  >y *  ' 

*k  3k-l 

UC  U  and  the  second  one  from  (4.3): 


P{  (X ,Y)  c  Ujx  +  Y  -  z^  <  P{  (X ,Y)  e  UjX  +  Y  -  z^ 


n 

-  I  P{  (X ,Y)  e  U  Jx  +  Y  -  z  } 

k-i  'll-1 

(4.6) 

<  l  P{  (X ,Y)  e  U  JX  +  Y  -  z2) 

k-i  W1 


P{  (X ,Y)  e  Ujx  +  Y  -  z 2 )  -  P{(X,Y)  e  ujx  +  Y  -  z2). 


Si  net*  open  upper  sets  can  be  approximated  by  fundamental  upper  domains  (Block 
and  Savits  (1979))  it  follows  from  (4.6)  that  for  every  open  upper  set  1' 


P{  (X,Y)  c  U|X  +  Y  -  Zl>  <_  P{  (X,Y)  c  U|X  +  Y  -  z2) 
and  the  proof  of  the  lemma  is  complete  by  (1.1).  || 

Lemma  4.3.  Let  X  *  (X,  ,  ...,X_)  and  Y  =  (Y, . Y  )  be  independent 

- - —  —  1  m  —  l  m 

and  assume 


(4.7) 


t (X. «... ,X  )|x  -  x  )  V  X 

x  m-x  m  in  i 


and 


(4.8) 


[(Y  ...,Y  )JY  -  y  ]  V  y 

i  m— l  mm  m 


furthermore ,  assume  that  X  and  Y  have  PF_  densities  (probability 

mm  l 

functions).  Then 


[  (X  +  Y  . X  +  Y  )  |  X  +  Y  -  z  ]  z, 

i  x  m— x  m-x.  m  m  m  i 


Proof .  Dearly,  for  any  increasing  function  g, 

L(g(X  +  Y ...... X  +  Y  )|X  +  Y  -  z  ]  -  F[«KX  ,Y  )|X  +  Y  -  z  ] 

I  I  m— x  m-l  m  m  m  m  m  1  m  m  m 

where  rp  (x  ,y  )  «=  Lf  g(X,  +  Y,  , . . .  ,X  .  +  Y  . )  |  X  •*  x  ,  Y  -  y  ] .  However , 
m  m  1  1  m-x  m—x  m  mm  m 

ii(x  ,y  )  decreases  in  x  and  in  y  because  of  (4.7),  (4.8)  and  in- 
m  m  m  m 

dependence.  Thus,  by  Lemma  4.2,  E[0(X  ,Y  )|X  +  Y  *  z  1  decreases  in 

m  m  »  m  m  m 


7.  . 

m 


Proof  of  Theorem  4.4.  Let  i  e  {l,...,n-l}.  Substitute  m  -  1  +  1  in 


7S 


lemma  4.  )  to  obtain 


1C1',  +  s, 


•  ,T.  + 


V 


i+1 


+  s 


i+1 


Zi+1! 


V 


i+1  ’ 


that  is,  T  +  S  is  NDS .  j  j 


S .  Examples . 


1 . 1  .  Tiie  multinomial  df. 

bet  (Tj,.,.,T  )  have  the  loint  probability  function  with  parameters 

(^iPj i * • • » )  j 


P(T1  "  V 


,T  = 


Cn>  * 


n: 


tj.! - tn!(N-  J  tl)I 


n 

n - (ilip«  > 


i-1 
n 


N-  l  t. 

S  H-i  1  n 

*  (1_  l  P4)  ,  t  >  0,  l  t  <  !M, 

i-1  i-1  1 


where  p  >  0  (i  *  l,...,n)  and  0  <  £  p  <  1 . 

1  1-1  1 

The  multinomial  df  is  the  conditional  df  of  Independent  Poisson 

rv's  Riven  their  sum.  Thus,  by  Theorem  4.3  the  multinomial  df  is  RR^ 

in  pairs  and  hence  it  is  also  CDS,  NUOD  and  NLOD .  By  Remark  (iii)  the 

joint  probability  function  of  (T  , . . . ,T  )  is  RR„  in  pairs.  Ry  the 

in  L 

discussion  after  Theorem  4.3  the  multinomial  df  satisfies  the  RR^ 

condition  of  Karlin  and  Rinott  (1980). 

To  show  that  (T,,...,T  )  is  NDS  it  is  enough  to  show  that  for 
1  n 

0  <  t  <  N-l , 

-  n  — 


(5.1.1) 


[<T, 


T  ) 
’  n-l' 


? 


f  <T. 


r  .) |t 

n-l  '  n 


t+1  ] 
n 


I'.t.uisp  then,  by  symmetry  one  has  (2.41).  The  left  hand  side  (l.HS)  of 

(5.1.1)  has  an  (n-1 )-dimensional  multinomial  df  with  parameters 

(N-i  n  ,»|  i  , .  .  .  ,q  j  )  and  the  right  hand  side  (RHS)  of  (5.1.1)  has  the 

same  df  with  parameters  (N-t  -1,  q,,...,q  .)  where  q.  ■  p./(l-p  ). 

n  i  n-i  11  n 


Thus  % 

(5.1.2)  LHS  (5.1.1)  S-  RHS  (5.1.1)  +  (S,,...,S  .) 

1  n-l 

wliere  (Sj,...,S  ^)  has  a  multinomial  df  with  parameters  (l,qj,. 

and  the  sum  on  the  RHS  of  (5.1.2)  is  of  independent  random  vectors. 
Inequality  (5.1.1)  follows  now  from  (5.1.2)  and  (1.3). 

l.et  (X ^  , .  .  .  ,X  )  he  th?  sum  of  Independent  n-dimensional ,  not 
necessarily  identically  distributed,  multinomial  random  vectors,  that 
let  (X^,...,Xn>  have  the  probability  generating  function 


•  q 


n 


is 


m 

TT 

4-1 


(p, „u.+. . .+p  „u  ) 
K14  1  1 n4  n 


n 


where  p^  ^  0,  (i  =  1 

,...,n) 

,  0  <  l  Pit  <  1 

i-1 

and  >_  1  (4*1,2,. 

The  univariate  marginal 

df 's 

of  a  multinomial 

df  is  a  binomial  df . 

Since  the  binomial  df 

has  a 

PF£  probability 

function  it  follows  from 

Theorem  4.4  that  (X^  , . 

•••V 

is  NDS.  Hence 

(X^,...,Xn)  is  also  NUOD 

and  NI.OI),  that  is, 


and 


p(xL  >  x1,...,xn  >  xn)  <  P(xi  >  xt) 

p<xi  —  xi . i*„>  i,:,  p«i  i«i>- 


These  inequalities  are  stated  unproven  in  Mallows  (1968).  Compare  also 
l.ehmnnn  (1966),  pp.  1  143,  1144  and  1151. 
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• .  .  Multivariate  normal. 

l  ot  T  =  (T,  ....  ,T  )  lip  ;;  multivariate  svnmotrir  normal  ramlom 
1  n 

vet  tor  with  Corr(1\,T  )  =  p  <_  0,  1  <_  •»  <  i  *  n .  Then  i>  >  -  (n-1)  '  . 

U;e  will  show  that  T  is  RR2  in  pairs. 

Using  Theorem  4.1  assume,  without  loss  of  generality,  that  FT ^  =  0 

anti  Var(T  )  =  1,  i  =  l,...,n.  I.et  Y,,...,Y  he  independent  identically 
1  in 

distributed  normal  rv's  such  that  EY^  =  0  and  Var(Y^)  «  1-p  (i  -  1 . n) 

and  let  he  an  independent  normal  rv  with  F.Y^  -  0  and 

Var(Y0)  =  (-p)"T(l-p)(l+(n-l)p).  Then 


<Ti--V  -  [(Y1 


,Yn)|Y0+  Y]+...+Yn 


0J. 


Since  any  normal  density  is  PF2  it  follows,  from  Theorem  4.3,  that  T 

is  l\R,(  in  pairs. 

In  fact  we  can  obtain  a  stronger  result.  If  the  correlation  matrix 
of  T  is  of  the  form 


(5.?.  1) 


n  \ 

^(rrl)1/2\ 

V 

rni 

Uv1>1/2I 

,,  /2  ,,1/2, 

( (r ^  ~3  )  « • • •  »  1 )  ) 


X.’lhTi* 


pa  i  rs . 


I  , . . .  ,  n  a nd 


n 


l 

i-1 


>  n-1 


then  T  is  RR^  in 


To  show  it  note  that  every  matrix  of  the  form  (5.2.1)  can  he  the 

correlation  matrix  of  a  multinomial  random  vector,  X_  =  (X^,.,.,X  ),  sav. 

(?) 

let  ,  l  -  1,2,...,  be  a  sequence  of  independent  random  vectors 

(  N  ^  /  *  v 

distributed  as  X_.  Clearly  Y/  '  *  J  It  is  a  multinomial  random 

t-1 

vector  with  correlation  matrix  (5,2.1).  Normalizing  Y 


such  that  it 


?R 


it. is  zero  moans  and  unit  variances,  it  converges  in  d istri hut  ion ,  hv  the 
'  hi  I t i variate  central  limit  theorem,  to  a  multivariate  norma]  random 
vector  with  correlation  matrix  (5.2.1).  By  Remark  (vii)  the  limit  in 
distribution  of  UR^  in  pairs  random  vectors  is  RR^  in  pairs.  The 
assertion  in  the  preceeding  paragraph  now  follows.  The  previous  result 
that  deals  with  the  symmetric  multivariate  normal  df  with  negative 
correlations  p  is  obtained  hy  taking  r  -  1  -  p  in  (5.2.1). 

Every  multivariate  normal  random  vector  T  with  nonpositive 
correlations  is  NDS.  It  is  easy  to  verify  (2.4')  directly  using  (1.2). 
Thus  T  satisfies  (2.3)  and  (2.3'). 


5.3.  Multivariate  hypergeometric. 

let  (Tj,...,T  )  have  the  probability  function 

/m-  y  m,\ 

■i 
:i 


l’(T  =  t1,...,T'n  -  tn)  -  m 


n 

7T 

i=l  t 


ihn* 

1  hi 

\  i-1  7 


»  t1  i  °»  l  t  I  N, 

i-1 


with  positive  integer-valued  parameter  vector  (N,  M  , . . .  ,M  ,  M)  (see 

1  n 

Johnson  and  Kotz  (1969]. 

The  multivariate  hypergeometric  df  is  the  conditional  df  of 

independent  binomial  rv's  given  their  sum.  Thus,  by  Theorem  4.3  the 

hypergeometric  df  is  RR£  in  pairs  and  hence  it  is  also  EDS,  NUOD 

and  NL01).  By  Remark  (Hi)  the  loint  probability  function  of  (T, , . . .  ,T  ) 

l  n 

is  RR^  in  pairs.  A  special  case  of  this  fact  was  observed  by  Lehmann 
(1966),  p.  1144.  See  also  Ebrahimi  and  Ghosh  (1980).  By  the  discussion 
after  Theorem  4.3  it  follows  that  this  df  satisfies  the  RR^  condition 


of  Karlin  and  Rtnott  (1980). 
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We  will  show  now  that  T  Is  NDS  hy  proving  that,  for  0  <  t  <  N-l, 

—  —  n  — 


r>.  k  i ) 


<T1 . Vl>lT„  ■  'n1  i  l<Tl . Vl’l'n  ■  *  11 


Then,  by  symmetry,  (2.4')  follows.  Since  the  random  vectors  on  each  side 
of  (5.3.1)  have  multivariate  hypergeometric  df  we  actually  have  to  show 


(5.3.2) 


U  V 


where  U  -  (Uj,...,ll  ^)  has  hypergeometric  df  with  parameters 

(N  +  l ,  M  , . . . ,M  , ,  M  +  1)  and  V  ■  (V.  , .  .  .  ,V  . )  has  the  same  df  with 
1  n-l  —  1  n-l 

parameters  (N,  , ,  M)  where  N  ■  N  -  t  -  1  and  M  -  M  -  t  -  1. 

1  n-l  n  n 

Denote  m  *  n  -  1 . 

Thinking  about  as  the  number  of  Individuals  in  the  sample  in 

Category  i  (i  =  1 . m)  it  is  easy  to  see,  by  conditioning  on  the  category 

of  the  first  individual  chosen,  that 

(U1,...,Um)  S-  (wjl),...tw£{,  W<*>  +  1,  wj*} . W<‘>)  if  the  first 

s  t 

individual  is  in  category  l,  i  •  l,2,...,m,  and  that  (U,,...,U  )  •  (V,,...,V 

I  m  J  Tn 

If  the  first  individual  is  in  neither  of  categories  l,2,..,,m,  where 

( 0 )  (l) 

(W  ...,W  )  has  a  multivariate  hypergeometric  df  with  parameters 

1  m 

(N,  j ,  M-l,  M).  Thus  for  proving  (5.3.2)  it 

suffices  to  show  that 


(5.3.3)  (W [l) . W<°+1,  W . W<°)  Sl  (Vx . Vm),  «,  -  1 . m. 


We  will  prove  (5.3.3)  when  £  ■  1;  the  proof  for  the  other  I's  is 
similar.  Omit  the  superscript  1  and  consider  (W^,...,Wm)  ■  (W^\...,W^  ^ 
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which  has  a  hypergeometr io  distribution  with  parameters 

(r;>  V1’  . vr  V  M)* 


Write 


M 


V-l 


P(V  -  v ...... V  «  v  ) 

11  mm 


mi » • • • ^  i 

i-i  i 


/ 


'1 . vm’  N  "  j 


i-1 


'  M  -  N  ^ 

M1  “  V"**  -  V  »*  -  N  -  l  +  l  Vi 

i-1  i=l 

N  / 


Then,  it  is  easily  seen,  that  the  rv’s  » • • • »vm  can  have  the  following 

interpretation:  first  divide  a  population  of  size  M  into  a  group  of  size 

N  and  another  group  of  size  M  -  N.  Next,  choose  at  random  individuals 

and  let  be  the  number  of  them  in  the  first  group  (of  size  N) ,  then 

choose  at  random  individuals  of  the  remaining  M  -  individuals  and 

let  V2  be  the  number  of  them  in  the  first  group.  Continuing  this  way, 

finally  choose  M  individuals  of  the  remaining  M  -  M.  -  ...  -  M  . 

individuals  and  let  V  be  the  number  of  them  in  the  first  group. 

m 

The  rv's  W, , . . . ,W  may  have  a  similar  interpretation. 

1  m 

With  these  interpretations  it  is  easily  seen  that  if  the  first 
individual  chosen  in  the  'V  experiment.'  is  not  in  the  first  group  then 

V  =l  W  S<C  (W,  +  1 ,  W„ , . . .  ,W  ) ,  where  the  inequality  follows  from  (1.2). 

------  1  l  m 

otherwise  (Vt . Vj  S«  (Wj  +  1 ,  W2,...,Wm>.  Thus,  unconditionally 

v  (Wj  +  l,...,Wm),  which  proves  (4.3.3)  when  9  -  1, 


5.4. 


The  Dirichlet  distribution. 


I. ot  T  =  (T.,...,T  )  have  the  density 
—  In 


r<  I  V 


f  ( t t  )  = 
l  n  n 


L  i ' 
1-0  2 


n  8-1  n  9,-1 

o  -  It,)0  ,  t,  J 


*  r<o  ) 

i-o  2 


l  ti)  .\  t*  ’  ci  1  °.  I  i  !• 

j-1  2  1=1  1  2  j-1  2 


whore  the  parameter  vector  (0^,0^ , . . . ,0^)  satisfies  0^  >  1,  j  =  0,1,. ..,n. 

The  Dirichlet  df  is  the  conditional  df  of  independent  gamma  rv’s 
given  their  sum.  Thus,  hy  Theorem  4.3  the  Dirichlet  df  is  RR2  in  pairs 
and  hence  it  is  also  CDS,  NUOD  and  NLOD.  By  Remark  (iii)  f  is  RR2  in 
pairs.  Special  case  of  this  fact  is  Example  10  (iii)  of  Lehmann  (1966).  See 
also  Ehrahimi  and  Ohosh  (1980).  By  the  discussion  after  Theorem  4.3  it 
follows  that  this  df  satisfies  the  RR2  condition  of  Karlin  and  Rinott 


(1980). 


To  prove  that  T  is  NDS  it  is  enough  to  show  that  for  t  <  t' 
-  n  —  n 


<5.4.1)  (<Tj . T„.1)hn  -  t„l  V  (<Tj, 


•Vi>iTn  - 


then  (2.4')  follows. 


It  is  easily  seen  that 


<i-cn)-‘i<T1,....Tn_1>hn  .  t„]  s-'  u-erhcr, . vplr,  -  t;i 


Since  1  -  t  >  1  —  t  *  (5.4.1)  follows  from  (1.4). 

n  -  n 


5.5.  Dirichlet  compound  multinomial  df. 


Let  T  -  (T,,...,T  )  have  the  probability  function 
-  I  n 


. .V-  X  r.’ 


l'(T 


i  lr 


,t  =  t  >  = 

n  n 


N:  r,J „V  .  rc.«.> 


n 

r(N  +  l  o  ) 

i=o  1 


1=1  Vr(V 


3? 


r(N  -  j'j  +  V 

- f -  & i-"' 

(N  -  F  t4)!r(9j  J 


J=i 


where  N  is  a  positive  integer  and  0^  >_  1,  j  ■  0,1,. . . ,n  [see  Johnson 
and  Kotz  (1969 ) ] . 

The  Dirichlet  compound  multinomial  df  is  the  conditional  df  of 
independent  Pascal  (negative  binomial)  rv's  given  their  sum.  Thus,  hv 
Theorem  4.3  this  df  is  RR^  in  pairs  and  hence  it  is  also  CDS,  NUOD 
and  NLOn,  and  by  the  remark  after  Theorem  4.3  it  satisfies  the  RR^ 
condition  of  Karlin  and  Rinott  (1980). 

To  show  that  T  is  NDS  it  suffices  to  prove  that 


(5.5.1) 


[  (T ...... T  .  )  |  T  *  t  ]  t 

1  n-i  n  n  n 


then  (2.4')  follows  by  symmetry.  But  the  random  vector  in  (5.5.1)  has 

the  Dirichlet  compound  multinomial  df  with  parameters  N  -  t^, 

0„,...,0  Using  this  fact  it  is  easy  to  verify  (5.5.1). 

0  n-i 


5.6.  Df's  supported  by  negatively  tilted  surfaces. 

In  this  subsection  we  illustrate  two  simple  examples  of  such  df's  in 
IK3.  The  general  idea  then  should  be  clear  and  will  not  be  detailed  here. 

I.et  (T]  »Tj  ,T^)  have  the  uniform  df  on  the  surface 
{(trt2,t3)  1  (0,0,0):  tx  +  t2  +  t3  -  1).  Then  f(Tj  ,T2)|T3  -  tj]  has 


1,111  .1  uiipuiii 

1 1  ■  i  r  ...  ■  —  ....  .iff)  maagaum 
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a  uniform  df  on  the  segment  >  (0,0):  +  tj  “  1  -  t^) 

which  is  clearly  stochastically  decreasing  in  t^.  By  symmetry  (2.4') 
holds,  hence  (Ti,T2,T3^  is  NDS .  Clearly  (Ti»^2’T3^  is  also  CDS. 

Similarly,  if  (Ti*T2,,p3^  have  the  uniform  df  on  the  surface  of 
the  unit  sphere  intersected  with  the  positive  (or  the  negative)  orthant 
then  (T1(T2,T3)  is  NDS  and  CDS. 

5.7.  NUOD  df's  in  reliability  theory. 

Buchanan  and  Singpurwalla  (1977)  considered  the  class  of  nonnegative 
multivariate  new  better  than  used  (NBU)  df's  that  satisfy,  for  all 
s  >  0,  £  >  0, 

0.7.1)  PIT,  >  S)  ♦  . Tn  >  So  +  tn)  iP(T,  »  .j . Tn  »  .„) 

“  r(Tl  >  h . T„  *  '„>• 

It  is  well  known  that,  when  n  *  2,  (5.7.1)  implies  that  are 

negatively  quadrant  dependent,  that  is 


(5.7.2) 


P(T1  >  ti»  T2  *  t2)  iP(Tl  >  tl)  P(T2  >  t2)> 


h'('  will  show,  hy  induction,  that  in  general  (5.7.1)  implies  that 

(T, . T  )  is  NUOD.  From  (5.7.2)  we  know  it  is  true  when  n  -  2.  Assume 

I  n 

that 


P(Tj 


,T 


n-1 


> 


t 

n- 


1 


) 


< 


n-1 

*  P(T  >  t.). 
i-1 


(5.7.3) 


Substitute 


Lo  obtain 
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